Plotting Wrappers: Occupancy Histogram #403

willGraham01 · 2025-02-04T13:04:18Z

Description

What is this PR

Bug fix
Addition of a new feature
Other

Why is this PR needed?

See #388 and related, #5 (which is actually also closed)

What does this PR do?

Adds the plot_occupancy function to the movement.plots module. This function takes in (time, space [x, y])-data and produces a histogram showing the distribution of positions across all time-points.

By default, any additional axes in the input da (DataArray) are collapsed onto the 0th-index, to provide the expected 2D data input. The selection argument can be used by the user to specify alternative coordinates along non-spacetime dimensions to collapse onto instead.

plot_occupancy returns the usual figure and axes objects, however also returns information from the plotted histogram as its third value. This is mainly because this information is difficult to re-extract from the returned axes figure. The counts information in particular would technically otherwise be lost since QuadMesh objects (that store histograms) only retain the colour-mapped values (which may blur across bins with similar, but distinct counts), and not the raw counts in each bin.

References

Closes #388. ~~Additionally, this hopefully goes some way towards addressing #5, since we are returning the histogram data as the 3rd return value.~~
Closes #5 too.

How has this PR been tested?

Addition of tests to cover expected functionality, and possible edge cases.

Is this a breaking change?

No

Does this PR require an update to the documentation?

#410

Checklist:

The code has been tested locally
Tests have been added to cover all new functionality
The documentation has been updated to reflect any changes
The code has been formatted with pre-commit

willGraham01 · 2025-02-11T11:27:41Z

@sfmig your comment on #5 indicates that it would be useful to have certain bits of information from the plot as outputs from this kind of function. Currently I'm just exposing the other hist2d outputs (that are suppressed by the wrapper otherwise) to the user here, not sure if you had more detailed outputs in mind when writing your comment.

But if so, we can also close #5 with this PR too.

sfmig · 2025-02-11T14:40:08Z

thanks for checking @willGraham01 !

The point of that comment was that often you not only want the figure, but also the 2D array with the bin counts. From your comment ...

Currently I'm just exposing the other hist2d outputs

seems like that is covered? So I think we can close #5 yay 😄 🚀

(Just fyi I vaguely remember this was something Sepi requested but not sure)

willGraham01 · 2025-02-11T14:47:20Z

(Just fyi I vaguely remember this was something Sepi requested but not sure)

I hope it is b/c otherwise I've just wasted 5 hours of Niko's grant 🤭 😂 But will mark #5 as closable by this 🥳

niksirbi · 2025-02-12T18:34:30Z

I will finish reviewing this tomorrow, but I can already do some cool things with this!

See source code for this figure

import numpy as np
from matplotlib import pyplot as plt

from movement import sample_data
from movement.plots import plot_occupancy

# Load the sample dataset 
ds = sample_data.fetch_dataset("DLC_two-mice.predictions.csv")

# Compute the centroid of all keypoints
centroid_position = ds.position.mean("keypoints")

image = plt.imread(ds.attrs["frame_path"])

# Construct bins of size 20x20 pixels that cover the entire image
bin_pix = 30
bins = [
    np.arange(0, image.shape[0] + bin_pix, bin_pix),
    np.arange(0, image.shape[1] + bin_pix, bin_pix),
]

# Initialize the figure and axis
fig, ax = plt.subplots()

# Show the image
ax.imshow(image)

# Plot the occupancy 2D histogram for each individual
_, _, hist_data = plot_occupancy(
    da=centroid_position,
    selection={"individuals": "individual1"},
    ax=ax,
    cmap="viridis",
    alpha=0.5,
    bins=bins,
    cmin=3,      # Set the minimum shown count
    norm="log"
)

# Set the axis limits to match the image
ax.set_xlim(0, image.shape[1])
ax.set_ylim(image.shape[0], 0)

niksirbi

Thanks @willGraham01!

I’ve added some comments, mostly about aligning the function signature (and default behavior) with that of plot_trajectory().

Regarding your discussion with Sofía:
Yes, this approach technically meets the requirement of also obtaining the occupancy data as a 2D array, which is excellent. However, it can be slightly awkward to always rely on the plotting function when all you need is the occupancy array. There may be scenarios where the user only wants the 2D occupancy array—without the plot—for comparisons with neural data. From that perspective, it might be more intuitive to have a dedicated compute_occupancy function that returns both the 2D array and the bin edges. We could discuss the best data structure to return—whether that’s an xr.DataArray or multiple NumPy arrays, similar to hist2d.

In any case, I suggest merging this PR with just plot_occupancy (after addressing my comments) and leaving compute_occupancy for a future PR. We just need to ensure that both functions produce consistent histogram data, i.e. compute_occupancy should use the same underlying method as hist2d.

movement/plots/__init__.py

movement/plots/occupancy.py

willGraham01 · 2025-02-17T11:26:37Z

Function signature is now in line with plot trajectory, and I've implemented the default behaviour of "take centroid of keypoints, then aggregate over individuals".

Note that the aggregation over individuals works by "stacking" the input DataArrays "individual" axis along the "time" axis. EG a (10, 2, 4) time-space-individuals array can be reshaped and counted as a (40, 2) array. This:

Makes it robust against NaN values (NaN values can be filtered on a per-position basis, since xarray.dropna currently doesn't support dropping NaNs along multiple axes simultaneously).
Prevents any issues with bin sizes not aligning. If we count each individual first, then sum the counts, providing bins=[30,20] will give non-aligned bin edges. By stacking first, we avoid this issue (note that passing explicit bin edges works in both cases, regardless).

niksirbi

Thanks @willGraham01! I like your approach to aggregating, it's entirely sensible.

I left some final comments to be addressed. I'll pre-approve this PR so you have the freedom to merge it as soon as you're done addressing those comments.

movement/plots/occupancy.py

niksirbi · 2025-02-17T18:02:40Z

movement/plots/occupancy.py

+    if "individuals" in data.dims:
+        data = data.stack(
+            {"new": ("time", "individuals")}, create_index=False
+        ).rename({"new": "time"})


In some cases I get the following warning here:

/Users/nsirmpilatze/Code/NIU/movement/movement/plots/occupancy.py:157: UserWarning: rename 'new' to 'time' does not create an index anymore. Try using swap_dims instead or use set_index after rename to create an indexed coordinate. ).rename({"new": "time"})

movement/plots/occupancy.py

Co-authored-by: Niko Sirmpilatze <[email protected]>

for more information, see https://pre-commit.ci

willGraham01 force-pushed the wgraham-388-occupancy-histogram branch from 3fe51e8 to 2cc312a Compare February 4, 2025 13:05

This comment was marked as resolved.

Sign in to view

willGraham01 changed the title ~~Plot wrapper for occupancy histogram~~ Plotting Wrappers: Occupancy Histogram Feb 4, 2025

willGraham01 linked an issue Feb 4, 2025 that may be closed by this pull request

Plotting wrappers: Occupancy Heatmap #388

Closed

willGraham01 mentioned this pull request Feb 5, 2025

Collapse dimensions common functionality for plot wrappers #405

Open

7 tasks

willGraham01 force-pushed the wgraham-388-occupancy-histogram branch from 415f620 to e412849 Compare February 11, 2025 11:13

willGraham01 marked this pull request as ready for review February 11, 2025 11:14

willGraham01 mentioned this pull request Feb 11, 2025

Example for "quick plot" functions #410

Open

willGraham01 requested a review from niksirbi February 11, 2025 11:24

niksirbi requested changes Feb 13, 2025

View reviewed changes

willGraham01 added 14 commits February 13, 2025 14:54

Basic histogram plot created

71b88b1

Allow kwargs to go to underlying function

0d08661

Remove manual debugging from package module

fb61a44

Write test, but it fails. But can't figure out why it fails...

448e3af

Additional return values to help extract histogram information

a4e5651

Test missing dims and entirely NAN values

03cb1db

Check that new / existing axes are respected

c3c77ae

Default units to pixels

c8cf1b4

SonarQube recommendations

e042f7c

Comply with new plot wrapper standards

be0d22b

Add test for default selection case

1614ab4

Add check for incorrect dims after squeezing

76a1973

Remove tests to start afresh

e3e690d

Merge branch 'main' into wgraham-388-occupancy-histogram

8ecd914

willGraham01 force-pushed the wgraham-388-occupancy-histogram branch from e412849 to 8ecd914 Compare February 14, 2025 14:45

Move trajectory tests into plots/ testing folder

7dd0ebe

niksirbi mentioned this pull request Feb 14, 2025

Several minor bugs in plot_trajectory #417

Open

willGraham01 added 4 commits February 17, 2025 11:02

Write tests for plot_occupancy

41055b6

Add examples in docstring for kwargs

7072bf5

SonarQube is confused, but fine

0e6d5dd

Test that ax argument doesn't complain

6b52cfc

willGraham01 requested a review from niksirbi February 17, 2025 11:26

niksirbi approved these changes Feb 17, 2025

View reviewed changes

willGraham01 and others added 4 commits February 18, 2025 09:03

Apply suggestions from code review

63c11ae

Co-authored-by: Niko Sirmpilatze <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

fc2d3f3

for more information, see https://pre-commit.ci

Apply suggestions from code review

61fde41

Close hanging matplotlib plots inside occupancy tests

4026105

willGraham01 enabled auto-merge February 18, 2025 09:13

Merge branch 'main' into wgraham-388-occupancy-histogram

4ed833b

This comment was marked as resolved.

Sign in to view

willGraham01 added this pull request to the merge queue Feb 18, 2025

Merged via the queue into main with commit 8fbc991 Feb 18, 2025
28 checks passed

willGraham01 deleted the wgraham-388-occupancy-histogram branch February 18, 2025 09:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Plotting Wrappers: Occupancy Histogram #403

Plotting Wrappers: Occupancy Histogram #403

willGraham01 commented Feb 4, 2025 •

edited

Loading

This comment was marked as resolved.

willGraham01 commented Feb 11, 2025

sfmig commented Feb 11, 2025 •

edited

Loading

willGraham01 commented Feb 11, 2025 •

edited

Loading

niksirbi commented Feb 12, 2025

niksirbi left a comment

willGraham01 commented Feb 17, 2025

niksirbi left a comment

niksirbi Feb 17, 2025

This comment was marked as resolved.

Plotting Wrappers: Occupancy Histogram #403

Plotting Wrappers: Occupancy Histogram #403

Conversation

willGraham01 commented Feb 4, 2025 • edited Loading

Description

References

How has this PR been tested?

Is this a breaking change?

Does this PR require an update to the documentation?

Checklist:

This comment was marked as resolved.

willGraham01 commented Feb 11, 2025

sfmig commented Feb 11, 2025 • edited Loading

willGraham01 commented Feb 11, 2025 • edited Loading

niksirbi commented Feb 12, 2025

niksirbi left a comment

Choose a reason for hiding this comment

willGraham01 commented Feb 17, 2025

niksirbi left a comment

Choose a reason for hiding this comment

niksirbi Feb 17, 2025

Choose a reason for hiding this comment

This comment was marked as resolved.

willGraham01 commented Feb 4, 2025 •

edited

Loading

sfmig commented Feb 11, 2025 •

edited

Loading

willGraham01 commented Feb 11, 2025 •

edited

Loading